Factored Multi-Agent Soft Actor-Critic for Cooperative Multi-Target Tracking of UAV Swarms
نویسندگان
چکیده
In recent years, significant progress has been made in the multi-target tracking (MTT) of unmanned aerial vehicle (UAV) swarms. Most existing MTT approaches rely on ideal assumption a pre-set target trajectory. However, practice, trajectory moving cannot be known by UAV advance, which poses great challenge for realizing real-time tracking. Meanwhile, state-of-the-art multi-agent value-based methods have achieved cooperative tasks. contrast, actor-critic (MAAC) face high variance and credit assignment issues. To address aforementioned issues, this paper proposes learning-based factored soft (FMASAC) scheme under maximum entropy framework, where swarm is able to learn an unknown environment. This method introduces idea value decomposition into MAAC setting reduce policy updates efficient assignment. Moreover, further increase detection coverage swarm, spatial reward (SER), inspired concept, proposed scheme. Experiments demonstrated that FMASAC can significantly improve performance it outperforms baselines terms mean success rates. Additionally, scales more successfully as number UAVs targets increases.
منابع مشابه
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment, while policy gradient suffers from a variance that increases as the number of agents grows. We then present an adaptation of actor-critic methods that considers...
متن کاملDecentralized and Cooperative Multi-Sensor Multi-Target Tracking With Asynchronous Bearing Measurements
Bearings only tracking is a challenging issue with many applications in military and commercial areas. In distributed multi-sensor multi-target bearings only tracking, sensors are far from each other, but are exchanging data using telecommunication equipment. In addition to the general benefits of distributed systems, this tracking system has another important advantage: if the sensors are suff...
متن کاملDistributed Asynchronous Algorithm for Collaborative Multi-UAV Multi-Target Tracking
The use of mobile sensor networks for multi-target tracking is an active research field. This paper describes the application of a novel two-layer relaxation for high-level control and coordination of UAVs to optimally track a set of moving targets considering a limited resources scenario. Using a singlebest tracking criterion, the scheme deploys distributed algorithms constrained to ensure ful...
متن کاملCooperative Multi-robot Target Tracking
Target tracking performance can be improved by using multiple robot trackers, but this requires a coordinated motion strategy among the robots. We propose an algorithm based on treating the densities of robots and targets as properties of the environment in which they are embedded. By suitably manipulating these densities a control law for each robot is proposed. The proposed algorithm has been...
متن کاملMulti-Agent Cooperative Target Search
This paper addresses a vision-based cooperative search for multiple mobile ground targets by a group of unmanned aerial vehicles (UAVs) with limited sensing and communication capabilities. The airborne camera on each UAV has a limited field of view and its target discriminability varies as a function of altitude. First, by dividing the whole surveillance region into cells, a probability map can...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Drones
سال: 2023
ISSN: ['2504-446X']
DOI: https://doi.org/10.3390/drones7030150